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During the past century, several epidemics of 
human African trypanosomiasis, a deadly disease 
caused by the protist Trypanosoma brucei, have 
afflicted sub-Saharan Africa. Over 10 000 new vic- 
tims are reported each year, with hundreds of 
thousands more at risk. As current drug treat- 
ments are either highly toxic or ineffective, novel 
trypanocides are urgently needed. The T. brucei 
galactose synthesis pathway is one potential ther- 
apeutic target. Although galactose is essential for 
T. brucei survival, the parasite lacks the transport- 
ers required to intake galactose from the environ- 
ment. UDP-galactose 4-epimerase (TbGalE) is 
responsible for the epimerization of UDP-glucose 
to UDP-galactose and is therefore of great interest 
to medicinal chemists. Using molecular dynamics 
simulations, we investigate the atomistic motions 
of TbGalE in both the apo and holo states. The 
sampled conformations and protein dynamics 
depend not only on the presence of a UDP-sugar 
ligand, but also on the chirality of the UDP-sugar 
C4 atom. This dependence provides important 
insights into TbGalE function and may help guide 
future computer-aided drug discovery efforts tar- 
geting this protein. 
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Human African trypanosomiasis (HAT), a disease caused by the pro- 
tist Trypanosoma brucei (T. brucei), directly affects thousands of 
sub-Saharan Africans and indirectly places hundreds of thousands 
more at risk. Current treatments are often ineffective or dangerous, 
necessitating the development of a new generation of HAT thera- 
peutics. 

Disease pathology occurs in two stages. The early hemolymphatic 
stage is characterized by fever, cephalgia, arthralgia, and pruritus. 
Current treatments for early-stage HAT include pentamine and sur- 
amin. However, pentamine, often used to treat the T. brucei gamb- 
iense strain, is associated with hypoglycemia and hypotension, and 
suramin, effective against T. brucei rhodesiense, is associated with 
severe cutaneous reactions, renal failure, anaphylactic shock, and 
neurotoxicity (1-4). 

Once the parasite crosses the blood-brain barrier, the more serious 
and often irreversible symptoms of the neurological phase are mani- 
fest, including disrupted cognition, coordination, and sleep (5). Since 
1949, melarsoprol has been the standard treatment for late-stage 
HAT. Despite its efficacy, a significant number of patients relapse; 
additionally, 5-10% of those who receive treatment develop severe 
encephalopathy, often leading to death (6). In the late 20th century, 
Aventis developed eflornithine, a drug that targets T. brucei orni- 
thine decarboxylase. While eflornithine is significantly less danger- 
ous than melarsoprol (7), it is ineffective against the T. brucei 
rhodesiense subspecies (8). 

As current therapeutics are problematic, medicinal chemists are 
actively seeking to identify novel T. brucei drug targets. The pro- 
teins of the biochemical pathway involved in galactose synthesis 
are excellent candidates. Although T. brucei requires galactose for 
the synthesis of vital glycoproteins (9), it is unable to intake galac- 
tose from the environment. Instead, glucose is acquired via hexose 
transporters (10) and is subsequently converted to galactose. One 
of the proteins in the pathway that effectuates this conversion, 
UDP-galactose 4'-epimerase (7Z)GalE), inverts the stereochemistry of 
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the UDP-glucose C4 carbon atom to produce UDP-galactose. The 
two-step reaction proceeds via a transient ketose intermediate and 
requires rotation of the sugar portion of the ligand for epimerization 
(Figure 1) (11). Initial drug discovery efforts targeting this protein 
have identified several promising inhibitors (12). Unexpectedly, vir- 
tual screening and subsequent experimental validation identified 
several agonists as well. 

This study provides insight into the general mechanisms of 76GalE 
ligand binding. The static lock-and-key model of binding, first pro- 
posed by Emil Fischer in 1894 (13), has been largely abandoned in 
favor of theories that account for receptor flexibility. Specifically, 
the induced-fit and population-shift models have been much 
debated over the course of the past 50 years (14). Induced fit sug- 
gests that apo and hob receptors sample distinct regions of confor- 
mational space; ligand binding itself induces conformational 
changes in the receptor (15). Population shift, on the other hand, 
suggests that an apo protein samples many conformations in solu- 
tion, a subset of which are amenable to ligand binding. Ligands 
bind to rare but amenable receptor conformations and energetically 
stabilize them, causing the population of all conformations to shift 
toward those that can accommodate the ligand (16-19). 




Figure 1: UDP-sugar epimerization. The epimerization reaction 
proceeds via a UDP-ketose intermediate. Epimerization requires a 
180° rotation ('flip') of the sugar portion of the ligand. 



Here, we explore the atomistic dynamics of the 7Z>GalE homodimer 
by investigating the major steps along its reaction coordinate. Using 
molecular dynamics (MD) simulations, we study the motions of 
7ibGalE homodimer in both the apo and hob forms, bound to UDP- 
galactose, UDP-glucose, and the UDP-ketose intermediate. The con- 
formations sampled by the protein depend not only on the presence 
of a UDP-sugar ligand, but also on the chirality of the UDP-sugar 
C4 atom. This dependence provides important insight into 7ibGalE 
function and may help guide future computer-aided drug discovery 
efforts targeting this protein. 

Methods and Materials 

System preparation 

A crystal structure of 7£GalE homodimer [PDB ID: 2CNB (20)] was 
obtained from the Protein Data Bank (21). To generate missing 
loops, the structure was aligned to a model of TbGalE that had 
been created previously (12) based on the 1GY8 TifaGalE structure 
(11). Following alignment, the coordinates of the missing loop 
atoms were copied from the 1GY8 model. All cocrystallized waters 
were retained; PDB2PQR (22) and PROPKA (23,24) were used to 
assign residue protonation states at pH 7.0. Histidine protonation 
states were visually inspected in VMD (25) to ensure optimal hydro- 
gen bonding. The NAD(H) and Y173 protonation states were manu- 
ally assigned to properly mimic the active site configuration needed 
for catalysis (11). 

IMAD+/NADH parameters were obtained from Walker et al. (26,27). 
Accelerys Discovery Studio 2.5 was used to model the structure of 
UDP-galactose by changing the fluorine atom of the 2CNB UDP-4- 
deoxy-4-fluoro-alpha-D-galactose ligand to a hydroxyl group. UDP- 
glucose and the UDP-ketose intermediate were built by altering the 
stereochemistry and hybridization of the UDP-galactose C4 carbon 
atom. Hydrogen atoms were added to the three UDP-sugars using 
Discovery Studio. All ligand partial charges were generated using 
gaussian03, 3 and ligand atoms were parameterized according to the 
GAFF force field (28). 

Receptor atomic parameters and partial charges were assigned 
according to the Amber ff99SB force field (29) using the Amber 10 
xleap module. 15 Sodium ions were subsequently added to bring the 
system to electric neutrality. The protein was then solvated in a 
TIP3P (30) water box that extended 10 A beyond the protein in each 
direction, and additional sodium and chloride ions were added to 
bring the total salt concentration to 20 miM. 



Molecular dynamics simulations 

NAMD 2.6 (31) was used for all minimizations, equilibrations, and 
free-dynamics runs. Minimization and equilibration steps were per- 
formed as described previously by Wang ef al. (28). In brief, each sys- 
tem was minimized in four phases totaling 45 000 minimization 
steps. Hydrogen atoms were relaxed in the first 5000 steps; hydrogen 
atoms and water molecules were relaxed in the next 5000 minimiza- 
tion steps; hydrogen atoms, water molecules, and the atoms of the 
protein backbone were relaxed in the next 10 000 minimization steps; 
and all atoms were relaxed for the last 25 000 minimization steps. 
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For equilibration, 1-ns simulations were performed at 31 OK using 
the final minimized structures as the initial coordinates. Harmonic 
constraints were placed on the atoms of the protein backbones and 
relaxed in a series of four 250-ps steps. The harmonic restraining 
force was weakened from 4.0 kcal/mol/A 2 in the first 250-ps seg- 
ment to 3.0, 2.0, and 1.0 kcal/mol/A 2 in the following steps, 
respectively. Before beginning the productive dynamics simulations, 
each system was checked to ensure that the root-mean-square 
deviation (RMSD) between the equilibrated and preminimization 
structures was <1 A. 

For each of the four systems, a 59-ns MD simulation was then per- 
formed with a 2-fs time step. Bonds with hydrogen atoms were 
constrained using the RATTLE algorithm (32), and water geometries 
were maintained using SETTLE (33), with a bond length error of 
0.0005 A. The temperature bath was kept at 31 OK with Langevin 
dynamics. The pressure was maintained at 1 atm using the Nose- 
Hoover-Langevin piston method (34) with period and decay times 
set at 100 and 50 fseconds, respectively. Long-range electrostatics 
were calculated using Particle mesh Ewald (35). The free-dynamics 
runs were performed on the TACC Ranger supercomputer. A typical 
benchmark on the 102 911 and 102 884 atom systems was 
4.35 nseconds/day of simulation on 96 processors. The system was 
sampled every 1 pseconds, generating a total of 59 000 coordinate 
snapshots. For analysis, every 5th frame was used. Each frame was 
aligned to the first frame of the trajectory by minimizing the alpha 
carbon root-mean-square (RMS) deviation using the RMSD Trajec- 
tory Tool in VMD (25). 



Trajectory clustering 

The monomers of each homodimer simulation were isolated, and 
the two resulting trajectories were concatenated to form a single 
monomeric trajectory. These monomeric trajectories were subse- 
quently clustered using the gromos algorithm as implemented in 
the gromos++ analysis software (36). The alpha carbon atoms in the 
active site, defined as all alpha carbon atoms belonging to a resi- 
due that was within 5 A of the NAD or UDP-sugar in the first 
frame of the trajectory, were used for the mass-weighted RMSD 
clustering. The RMSD cutoff was increased by 0.05 A until the tra- 
jectory clustering yielded fewer than 30 clusters, with over 90% of 
all frames contained in the seven largest clusters. 



Hydrogen bond analysis 

Frames extracted from the simulation every 50 pseconds were used 
for hydrogen bond analysis. The program HBonanza (37) was set to 
identify all persistent hydrogen bonds present in at least 75% of 
the frames. Angle and distance cutoffs of 30° and 3.5 A were used, 
respectively. 



Principal component analysis 

Principal component analysis (PCA) was performed using the ptraj 
module in Amber 10 b . Normalizing each eigenvalue of the covari- 
ance matrix to its total sum yields the percent of all 77jGalE move- 
ments attributable to the corresponding eigenvector. That is, the 
largest eigenvalues correspond to the PC modes that best explain 



the molecular motions sampled by the system trajectories. The prin- 
cipal component projections were visualized using a modified ver- 
sion of the Interactive Essential Dynamics module in VMD (38). 



Identifying highly conserved active site 
residues 

UniProt (39) was used to identify 7142 members of the sugar epim- 
erase family. Only reviewed sequences were considered for subse- 
quent analysis. These sequences were loaded into the MultiSeq 
(40) module of the VMD molecular graphics computer program (25). 
MultiSeq was used to generate a non-redundant set of four repre- 
sentative sequences from this input. ClustalW (41) was then used 
to align these four sequences to the sequence of 7Z>GalE obtained 
from the PDB structure 2CNB (20). A set of active site residues was 
obtained by identifying all receptor residues from the 2CNB 
structure (chain A) that came within 3.5 A of the cocrystallized 
NAD + cofactor and the UDP-sugar substrate. 



HsGalE inhibition and thermal scanning 
fluorimetry assay 

Recombinant wild-type f/sGalE and human UDP-glucose dehydroge- 
nase (ffeUGDH) proteins were expressed in £ coll and purified as 
described previously (42,43). DTP compounds were obtained from 
the NCI/DTP Open Chemical Repository (http://dtp.cancer.gov). 

The rate of the ffeGalE-catalyzed reaction was determined by cou- 
pling it to the oxidation of UDP-glucose by the action of ffeUGDH, 
essentially as previously described (42). Reaction mixtures (20 nM 
HsGalE, 1.2 HsUGDH, 100 //m UDP-Gal, 10 mM NAD + 10 mM 
HEPES-NaOH, pH 8.8, 1% (v/v) DMSO) were set up in triplicate 
with and without DTP compounds (100 fiu) in a total volume of 
150 ^(L. The rate of NADH formation was measured at 340 nm for 
20 min at 37 °C using a Multiskan Spectrum plate-reader spectro- 
photometer (Thermo Scientific, Hemel Hempstead, UK) and was 
used to calculate the rate of production of UDP-glucose by WsGalE. 
Initial rates were calculated from the linear section of each pro- 
gress curve and the activity expressed as a percentage of the activ- 
ity in the absence of the compound. 

The binding of the DTP compounds was measured using a thermal 
scanning fluorimetry assay (44). This assay has been used previ- 
ously to identify small molecular chaperones for the treatment of 
phenylketonuria (45), to identify stabilizing additives that facilitate 
crystallization (46), and to measure the binding affinities of car- 
bonic-anhydrase inhibitors (47). Sypro orange (Sigma, Poole, UK), a 
fluorescent dye, was diluted from a 5000x solution (manufacturer's 
concentration definition) into a 50 x solution with 10 mM HEPES, 
pH 8.8. This stock solution was mixed well prior to each use. Reac- 
tions were carried out in a total volume of 20 fd. and contained 
5 fM HsGalE, 100 /;m DTP compound, 10 mM HEPES, pH 8.8, 1% 
(v/v) DMSO, 5x Sypro orange. Melting curves were determined 
using a Rotor-Gene Q cycler (Qiagen, Crawley, UK) with the follow- 
ing protocol: high-resolution melt run (460-nm source, 510-nm 
detector), 25-95 °C ramp with a 1 °C rise for each step, and no 
gain optimization. The melting temperatures (7~ m ) were calculated 
using the inbuilt analysis software. 
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Results and Discussion 

Evidence of simulation equilibration 

Alpha carbon RMSD plots of each homodimer trajectory are shown 
in Figure 2. As the initial protein conformation of each system 
was that of the crystallographic UDP-galactose-bound state, the 
first 9 nseconds of each homodimer simulation were discarded to 
account for system equilibration. The remaining 50 nseconds of 
the dimeric simulation were used for subsequent analysis. In total, 
400 nseconds of productive 7ibGalE monomer simulation were 
generated. 



A gating mechanism may mediate ligand 
binding 

To better understand how receptor dynamics might impact ligand 
binding, PCA was used to identify the most important molecular 
motions of each of the four simulations. Principal components, or 
eigenvectors, were first calculated for the backbone atoms of 
each trajectory, and each trajectory was subsequently projected 
onto the apo eigenvectors for reference. A majority of the vari- 
ance in the molecular motions could be explained by the first 
two principal components, with almost 40% of the variance 
explained by the first eigenvector alone (Figure 3A). Consequently, 
only the first two principal components were used for subsequent 
analysis (Figure 3B). 

The Interactive Essential Dynamics computer package (38) was 
used to identify the extreme structures of each simulation (Fig- 
ure 3C,D), as judged by projection onto the first apo eigenvector. 




0 10 20 30 40 50 60 

Time (ns) 



Figure 2: Trajectory RMSD. Each dimer simulation was aligned 
to the first frame by minimizing the root-mean-square deviation 
(RMSD) of the Ca's. RMSD was calculated using the first frame as 
a reference. For this plot, and in subsequent figures, apo is 
depicted in black, UDP-galactose in blue, UDP-glucose in red, and 
the UDP-ketose intermediate in brown. As each substrate was 
derived from UDP-galactose, the systems needed 9 nseconds to 
equilibrate (gray box). Trajectory analysis was performed on the 
subsequent 50 nseconds of each dimer simulation. 



The largest differences in the motions defined by the first princi- 
pal component were present in the loop region containing resi- 
dues 230-255. This loop was already thought to be highly 
dynamic, given that residues 236-248 were unresolved in the 
2CNB crystal structure (20). 

Much of the difference in the minimum extreme structures can be 
explained by a kink that forms at the C-terminal end of the a-helix 
preceding the loop containing residues 230-255 in the apo simula- 
tion (Figure 3E). This helical bend causes the flexible loop to move 
toward the active site, as evidenced by the change in the angle 



A B 




Figure 3: Principal component analysis (PCA). To facilitate com- 
parison, all trajectories were projected onto the principal compo- 
nents of the apo simulation. Apo is shown in black, UDP-galactose 
in blue, UDP-glucose in red, and the UDP-ketose intermediate in 
brown. (A) Percent of the trajectory dynamics attributable to each 
of the top 25 apo eigenvectors. A majority of these motions can be 
explained by the first two principal components (PC). (B) Projection 
of the trajectories onto the first two apo eigenvectors. Each hob 
simulation explores its own unique set of motions. (C) The PC1 min- 
imum extreme structures for each simulation, color-coded as above. 
The location of gate is boxed in green. (D) The PC1 maximum 
extreme structures, color-coded as above. The location of gate is 
boxed in green. (E) The closed conformation evident in the PC1 min- 
imum apo structure results from a kink in the highlighted a-helix. 
The PC1 minimum structure is shown in cyan/gray, and the maxi- 
mum is shown in blue/black for reference. 
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defined by the alpha carbon atoms of D245, G228, and R109, which 
ranges from 67.69° to 36.01° as PC1 decreases, compacting the 
protein. Additionally, the corresponding distance between D245 and 
R 1 09 similarly changes from 29.81 to 16.55 A. The loop may there- 
fore serve as a gating mechanism likely relevant to ligand binding 
(Figure 3). As the gating loop containing residues 230-255 is near 
the active site, it is possible that the movement of this loop helps 
control the UDP-sugar rate of entry. 



UDP-sugar binding: induced fit vs. population 
shift 

The principal components suggest that both induced fit and confor- 
mational shift play a role in the binding of 7ibGalE to its natural 
substrates. The projection of the apo simulation onto the first two 
principal components demonstrates that the apo protein explores a 
large region of conformational space (Figure 3B). Similar projections 
of the remaining simulations onto the same apo principal compo- 
nents reveal that the UDP-glucose and UDP-galactose simulations 
sample distinct regions within this larger, apo conformational space, 
suggesting a population shift mechanism of binding. On the other 
hand, the UDP-ketose intermediate samples a region of conforma- 
tional space largely unexplored by the apo protein; the intermediate 
itself may therefore provoke receptor conformational changes via an 
induced-fit mechanism. Limited conformational overlap was 
expected as the ligands only differ at the UDP-sugar C4 atom; 
these overlaps likely represent the transitions between states dur- 
ing the actual reaction. 

These results suggest a general binding mechanism that may be 
relevant to the study of other receptor-ligand systems as well. The 
apo protein likely samples a large region of conformational space, 
occasionally assuming conformations in which the gating loop is 
open, permitting access to the main, UDP-sugar binding site. When 
UDP-glucose binds, it stabilizes certain conformations, causing the 
region of the sampled conformational space to constrict via a popu- 
lation shift mechanism. Next, the bound ligand induces changes in 
the receptor conformation uncharacteristic of the apo protein via an 
induced-fit mechanism. These conformational changes are likely 
caused by the transformation of the sugar and are required to 
accommodate the UDP-ketose intermediate. As the epimerization 
progresses, the conformational space sampled by the protein 
returns to that sampled in the apo state, again indicative of a pop- 
ulation shift mechanism of binding. 

Remarkably, the regions of the apo conformational space sampled 
by the UDP-galactose- and UDP-glucose-bound proteins are distinct, 
despite the fact that these two ligands differ by only a chiral inver- 
sion at a single sugar carbon atom, demonstrating that even small 
differences in bound ligands can drastically change the region of 
conformational space sampled. This finding has significant implica- 
tions for computer-aided drug design, as it suggests that the crys- 
tallography and MD-derived structures used in virtual screening 
can vary remarkably depending on the bound ligand. As previous 
computer-aided drug discovery efforts directed toward 7iGalE only 
used holo simulations to generate receptor structures for virtual 
screening (12), additional screening against apo conformations may 
reveal new, effective 7£GalE inhibitors. 



Hydrogen bonding of the UDP-sugar 

To examine how the hydrogen bond network of the ligand changes 
over the course of the reaction, we first identified persistent hydro- 
gen bonds present in over 75% of the trajectory frames analyzed. 
As expected, the hydrogen bond network that mediates the binding 
of the UDP-ketose intermediate differs from the networks associ- 
ated with UDP-galactose and UDP-glucose (Figure 4A-C). Two of 
the identified hydrogen bonds mediate interactions with Y173 and 
S142, conserved residues of the catalytic triad that are critical for 
the two-step GalE epimerization reaction (Figure 1) (11,48). 

The hydrogen bond with S142 is particularly interesting, as earlier 
studies of fcGalE suggested that S142 shuttles a proton to Y173 (49). 
However, crystallographic studies of several GalE proteins have called 
into question this theory, as they reveal that the distance between 
the S142 and Y173 hydroxyl groups is generally large. Our results sug- 
gest that the specific role of S142 is to stabilize the UDP-ketose inter- 
mediate via hydrogen bonding with the C4 hydroxyl group during the 
conformational flip required for epimerization (Figure 4B). 

The UDP-sugar hydrogen bond networks were generally similar 
regardless of which monomer was used for analysis, with the excep- 
tion of UDP-galactose. When acceptor-donor heavy-atom distances 
were calculated over the course of the trajectories (Figure 4D-F), it 
was noted that the chain B active site of UDP-galactose underwent a 
conformational shift approximately 5 nseconds into its trajectory, 
causing a break in the hydrogen bonds between the UDP-galactose 
and S142 and Y173, respectively. However, this shift was not 
observed in chain A, which had a UDP-sugar hydrogen bond network 
more consistent with the 7iGalE crystal structure, 2CNB. Additionally, 
despite the conformational shift evident in chain B, the key sugar con- 
tacts with S142 and Y173 continued to transiently reform during the 
course of the simulation, confirming that these residues play an 
important role in stabilizing the UDP-sugar. Other residues, such as 
N202 and H221, also formed hydrogen bonds with the UDP-sugar, but 
more transiently (data not shown). 

Like S142 and Y173, L102 may also be important for catalysis, war- 
ranting further pharmacological study. Our simulations suggest that 
L102 may be fundamental to the conformational flip required for epi- 
merization (Figure 1). The carbonyl oxygen atom of L102 is predicted 
to form a persistent hydrogen bond with the C6 hydroxyl group of the 
UDP-ketose intermediate. Previous crystallographic studies of a UDP- 
galactose-like ligand suggested that L102 might mediate ligand bind- 
ing through interactions with the UDP-sugar C6 hydroxyl group (20). 
Our simulations suggest that an additional function of the L102 back- 
bone carbonyl group is to maintain the UDP-ketose-intermediate in a 
conformation approximately halfway between that of UDP-galactose 
and UDP-glucose via a persistent hydrogen bond. The average UDP- 
sugar [C1-0-P-0] dihedral angle of the UDP-ketose intermediate is 
approximately halfway between that of UDP-galactose and UDP-glu- 
cose (Figure 4G), supporting this notion. 

The same hydrogen bond networks that mediate UDP-sugar binding 
may be germane to the design and optimization of potential 
small-molecule therapeutics. For example, a common practice in 
designing inhibitors is to create transition-state analogs. Our simu- 
lation of the UDP-ketose intermediate bound to 7Z>GalE has eluci- 
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Figure 4: Ligand hydrogen bonding. Hydrogen bonds present in 
over 75% of the 1000 equidistant frames extracted from each MD 
simulation are shown in green. In all charts, UDP-galactose is 
shown in blue, UDP-glucose is shown in red, and the UDP-ketose 
intermediate is shown in brown. For D-F, chain A is represented by 
the darker color, and chain B by the lighter color. (A) UDP-glucose. 
The glucose is held in place by two hydrogen bond interactions 
with Y173. (B) UDP-ketose intermediate. The orientation of the 
sugar is flipped from that of UDP-glucose. The sugar C6 hydroxyl 
group interacts with L102, and the intermediate carbonyl oxygen is 
tethered to S142. (C) UDP-galactose. The now epimerized sugar, still 
flipped, forms a single hydrogen bond with Y173. (D-F) Moving 
average using 500 pseconds windows. (D) Distance from the L102 
carbonyl oxygen to the C6 hydroxyl on the UDP-sugar. (E) Distance 
from the S142 side chain hydroxyl to the C4 hydroxyl on the UDP- 
sugar. (F) Distance from the Y173 side chain hydroxyl to the C4 
hydroxyl on the UDP-sugar. (G) [C1-0-P-0] torsion angle of the 
UDP-sugar. C1 belongs to the sugar. 



dated the important role L102 may play in stabilizing this intermedi- 
ate. These results suggest that L102 may be a good residue to tar- 
get in future drug discovery efforts, in addition to the already 
identified Y173 and S142 (20). 



Identification of conserved residues 

As current trypanocidal compounds are subject to growing resis- 
tance (50-54), future drug design strategies should also attempt to 
anticipate mutations that may reduce therapeutic efficacy. One 
strategy to avoid resistance is to pharmacologically target protein 
residues that are conserved across related members of the same 
protein family. Conserved residues are likely to be essential for 
catalysis and/or the binding of natural substrates and thus are less 
likely to undergo resistance-conferring mutations. 

After considering the 392 reviewed members of the sugar epimer- 
ase family listed in UniProt, five representative proteins spanning 
both the eukaryotic and bacterial domains of life were selected: 
Trypanosoma brucei GalE (PDB ID: 2CNB), a chloroplastic protein 
from Arabidopsis thaliana (UniParc: Q8H124), GaM 0 from Schizosac- 
charomyces pombe (UniParc: Q9HDU3), bifunctional polymyxin resis- 
tance protein ArnA from Yersinia enterocolitica (UniParc: A1JPN5), 
and probable rhamnose biosynthetic enzyme 1 from Arabidopsis 
thaliana (UniParc: Q9SYM5). The sequences of these five proteins 
were aligned, and the following 7MalE active site residues were 
found to be highly conserved: G7, G10, 112, D75, A100, N117, S141, 
S142, A143, Y173, K177, and R268. Of these, S142, A143, Y173, 
K177, and R2B8 appear to associate with the UDP-sugar. 

A143 is particularly noteworthy. While A143 did not form a persis- 
tent hydrogen bond with the UDP-sugar C3 hydroxyl group in our 
simulations (i.e., the bond was present in fewer than 75% of all 
simulation frames analyzed), this hydrogen bond did form tran- 
siently, consistent with the 2CNB crystal structure. As S142 and 
A143 form hydrogen bonds with adjacent UDP-sugar hydroxyl 
groups, concurrent binding likely serves to stabilize the conforma- 
tion of the UDP-sugar in a conformation amenable to catalysis. 
Thus, small-molecule inhibitors that have adjacent hydroxyl groups 
or similar hydrogen bonding moieties might be good candidates tar- 
geting these two conserved residues. 



Allosteric binding 

In a recent study, Durrant et al. (12) used virtual screening to identify 
7ibGalE ligands from among the compounds of the NCI Diversity Set II. 
Fourteen low-micromolar inhibitors were ultimately reported. How- 
ever, in addition to these antagonists, seven agonists, unreported at 
the time, were also identified (Table S1). We found this result inter- 
esting, as the agonists were identified by docking small-molecule 
models into the 7£GalE active site and therefore should compete with 
the UDP-sugar substrate rather than enhance its catalysis. 

Although 7MialE agonists are ill suited as HAT therapeutics, a 
human condition known as type III galactosemia arises from a 
deficiency in WsGalE. Understanding GalE agonism is therefore of 
great therapeutic interest. To determine whether agonism might 
arise from allosteric binding, the hob and apo monomer trajectories 
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were clustered into five groups using RMSD cutoffs of 0.7 and 
0.75 A, respectively (Figure 5A). A representative protein conforma- 
tion was then taken from each cluster; together, these representa- 
tive conformations are said to constitute an ensemble. 

The FT-MAP server (55) was then used to computationally flood the 
entire surface of the ensemble conformations with models of small 
organic probes in an attempt to identify potential binding pockets. 
FT-MAP identified the NAD(H) and UDP-sugar binding pockets in 
every structure; however, no other potential allosteric pockets were 
consistently recognized (Figure 5B), suggesting that agonism does 
not likely occur by allosteric modulation. 

While the possibility of allosteric binding at an uncharacterized sec- 
ondary site could not be ruled out, the fact that FT-MAP revealed 
no such site suggests cooperative agonism mediated by binding to 
the principal site. It may be that agonist binding to one 7ibGalE 




Figure 5: Binding pocket identification. (A) Active site clustering 
of the 7Z)GalE monomers. The frames of each simulation were clus- 
tered by the active site Ca's using 0.70 and 0.75 A RMSD cutoffs 
for the holo and apo simulations, respectively. The top two repre- 
sentative structures of chain A from each simulation are shown; 
apo is depicted in black, UDP-galactose in blue, UDP-glucose in red, 
and the UDP-ketose intermediate in brown. Darker and lighter col- 
ors correspond to the most populated and the second most popu- 
lated cluster, respectively. (B) FT-MAP analysis. The top five clusters 
from each chain were submitted to the FT-MAP server. Shown are 
the results for the top chain A cluster of each simulation. These 
results suggest that 7ibGalE contains no allosteric sites; the 
observed agonism likely results from ligand-induced dimerization 
and/or cooperativity between the two monomers. 



monomer alters the affinity of the binding site on the other dimeric 
monomer, thus facilitating additional binding. The steep Hill coeffi- 
cients associated with several of the inhibitors, even in the pres- 
ence of a detergent that disrupts colloidal aggregates, support this 
theory (56,57). Hemoglobin is the classic example of cooperative 
binding, but many other examples exist in nature as well. As an 
alternate explanation, agonist binding to the principal site of one 
7iGalE monomer might drive dimerization by stabilizing the four- 
helix bundle at the dimer interface. This theory is supported by pre- 
vious evidence in related proteins suggesting that, while the mono- 
mer is functional, full functionality is achieved only through 
dimerization (58,59). 

Unfortunately, when the identified 76GalE agonists were tested 
against f/sGalE, no agonism was noted (Figure SI, Tables S2 and 
S3). However, it is possible that the lack of activity arose from dif- 
ferences in the human and T. brucei versions of the GalE protein 
rather than fundamental differences in the agonistic mechanism. 
For example, the NAD + adenine-binding pocket is more closed in 
7ibGalE, and WsGalE G237 in the UDP-sugar binding domain is 
replaced by C266 in FfiGalE, a potentially reactive residue that may 
prove useful in future drug design efforts (7 7). Further efforts to 
identify WsGalE agonists may therefore be justified. 

Conclusions and Future Directions 

In this work, we have used molecular simulations to probe the dynam- 
ics of F/jGalE and to specifically analyze the mechanisms governing 
ligand-binding and enzymatic conversion. Our MD simulations suggest 
that the conformations sampled by 7ftGalE are highly dependent on 
the composition of the ligand, as even the chirality of the UDP-sugar 
C4 atom greatly affected the conformations explored. 

Additionally, we have identified an active site residue, L102, that 
may be important in the stabilization of the UDP-ketose intermedi- 
ate. While this residue has been previously identified in crystallo- 
graphy studies as a potential mediator of UDP-galactose binding 
(20), to our knowledge, its possible role in stabilizing the UDP-ke- 
tose intermediate has not been previously recognized. 

Finally, as none of our simulations revealed any 7feGalE allosteric 
sites, we anticipate that dimeric agonism likely results from either 
cooperative binding or dimeric stabilization. We hoped that the 
results presented here will not only provide insight into the function 
of this and related enzymes, but also assist future computer-aided 
drug discovery efforts targeting 7iGalE and HsGalE. 
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